106 research outputs found
Tree pattern matching from regular tree expressions
summary:In this work we deal with tree pattern matching over ranked trees, where the pattern set to be matched against is defined by a regular tree expression. We present a new method that uses a tree automaton constructed inductively from a regular tree expression. First we construct a special tree automaton for the regular tree expression of the pattern , which is somehow a generalization of Thompson automaton for strings. Then we run the constructed automaton on the subject tree . The pattern matching algorithm requires an time complexity, where is the number of nodes of and is the size of the regular tree expression . The novelty of this contribution besides the low time complexity is that the set of patterns can be infinite, since we use regular tree expressions to represent patterns
SAMOS - A framework for model analytics and management
The increased popularity and adoption of model-* engineering paradigms, such as model-driven and model-based engineering, leads to an increase in the number of models, metamodels, model transformations and other related artifacts. This calls for automated techniques to analyze large collections of those artifacts to manage model-* ecosystems. SAMOS is a framework to address this challenge: it treats model-* artifacts as data, and applies various techniques—ranging from information retrieval to machine learning—to analyze those artifacts in a holistic, scalable and efficient way. Such analyses can help to understand and manage those ecosystems
Weak factor automata : the failure of failure factor oracles?
In indexing of, and pattern matching on, DNA and text sequences, it is often important to represent all factors of a
sequence. One e cient, compact representation is the factor oracle (FO). At the same time, any classical deterministic
nite automaton (DFA) can be transformed to a so-called failure one (FDFA), which may use failure transitions to replace
multiple symbol transitions, potentially yielding a more compact representation. We combine the two ideas and directly
construct a failure factor oracle (FFO) from a given sequence, in contrast to ex post facto transformation to an FDFA. The
algorithm is suitable for both short and long sequences. We empirically compared the resulting FFOs and FOs on number
of transitions for many DNA sequences of lengths 4 - 512, showing gains of up to 10% in total number of transitions, with
failure transitions also taking up less space than symbol transitions. The resulting FFOs can be used for indexing, as
well as in a variant of the FO-using backward oracle matching algorithm. We discuss and classify this pattern matching
algorithm in terms of the keyword pattern matching taxonomies of Watson, Cleophas and Zwaan. We also empirically
compared the use of FOs and FFOs in such backward reading pattern matching algorithms, using both DNA and natural
language (English) data sets. The results indicate that the decrease in pattern matching performance of an algorithm using
an FFO instead of an FO may outweigh the gain in representation space by using an FFO instead of an FO.http://www.journals.co.za/ej/ejour_comp.htmlam201
Current Trends in Digital Twin Development, Maintenance, and Operation:An Interview Study
Digital twins (DT) are often defined as a pairing of a physical entity and a corresponding virtual entity mimicking certain aspects of the former depending on the use-case. In recent years, this concept has facilitated numerous use-cases ranging from design to validation and predictive maintenance of large and small high-tech systems. Although growing in popularity in both industry and academia, digital twins and the methodologies for developing and maintaining them differ vastly. To better understand these differences and similarities, we performed a semi-structured interview research study with 19 professionals from industry and academia who are closely associated with different lifecycle stages of the corresponding digital twins. In this paper, we present our analysis and findings from this study, which is based on eight research questions (RQ). We present our findings per research question. In general, we identified an overall lack of uniformity in terms of the understanding of digital twins and used tools, techniques, and methodologies for their development and maintenance. Furthermore, considering that digital twins are software intensive systems, we recognize a significant growth potential for adopting more software engineering practices, processes, and expertise in various stages of a digital twin's lifecycle
Current Trends in Digital Twin Development, Maintenance, and Operation: An Interview Study
Digital twins (DT) are often defined as a pairing of a physical entity and a
corresponding virtual entity mimicking certain aspects of the former depending
on the use-case. In recent years, this concept has facilitated numerous
use-cases ranging from design to validation and predictive maintenance of large
and small high-tech systems. Although growing in popularity in both industry
and academia, digital twins and the methodologies for developing and
maintaining them differ vastly. To better understand these differences and
similarities, we performed a semi-structured interview research study with 19
professionals from industry and academia who are closely associated with
different lifecycle stages of the corresponding digital twins. In this paper,
we present our analysis and findings from this study, which is based on eight
research questions (RQ). We present our findings per research question. In
general, we identified an overall lack of uniformity in terms of the
understanding of digital twins and used tools, techniques, and methodologies
for their development and maintenance. Furthermore, considering that digital
twins are software intensive systems, we recognize a significant growth
potential for adopting more software engineering practices, processes, and
expertise in various stages of a digital twin's lifecycle
Forest FIRE and FIRE wood : tools for tree automata and tree algorithms
Pattern matching, acceptance, and parsing algorithms on node-labeled, ordered, ranked trees ('tree algorithms') are important for applications such as instruction selection and tree transformation/term rewriting. Many such algorithms have been developed. They often are based on results from such algorithms on words or generalizations thereof using finite (tree) automata. Regrettably no coherent, extensive toolkit of such algorithms and automata existed, complicating their use.
Our toolkit FOREST FIRE contains many such algorithms and automata constructions. It is accompanied by the graphical user interface (GUI) FIRE WOOD. The toolkit and GUI provide a useful environment for experimenting with and comparing the algorithms. In this tool paper we give an overview of the toolkit and GUI, their context and design rationale, and mention some results obtained with them
Deriving the Boyer-Moore-Horspool algorithm
The Post-Proceedings of this Festschrift will be formally published in The South African Computer Journal number 41
- …